Picture for Andrew Szot

Andrew Szot

From Multimodal LLMs to Generalist Embodied Agents: Methods and Lessons

Add code
Dec 11, 2024
Viaarxiv icon

ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI

Add code
Oct 03, 2024
Viaarxiv icon

Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

Add code
Jul 09, 2024
Figure 1 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 2 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 3 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Figure 4 for Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
Viaarxiv icon

Reinforcement Learning via Auxiliary Task Distillation

Add code
Jun 24, 2024
Figure 1 for Reinforcement Learning via Auxiliary Task Distillation
Figure 2 for Reinforcement Learning via Auxiliary Task Distillation
Figure 3 for Reinforcement Learning via Auxiliary Task Distillation
Figure 4 for Reinforcement Learning via Auxiliary Task Distillation
Viaarxiv icon

Grounding Multimodal Large Language Models in Actions

Add code
Jun 12, 2024
Figure 1 for Grounding Multimodal Large Language Models in Actions
Figure 2 for Grounding Multimodal Large Language Models in Actions
Figure 3 for Grounding Multimodal Large Language Models in Actions
Figure 4 for Grounding Multimodal Large Language Models in Actions
Viaarxiv icon

Large Language Models as Generalizable Policies for Embodied Tasks

Add code
Oct 26, 2023
Figure 1 for Large Language Models as Generalizable Policies for Embodied Tasks
Figure 2 for Large Language Models as Generalizable Policies for Embodied Tasks
Figure 3 for Large Language Models as Generalizable Policies for Embodied Tasks
Figure 4 for Large Language Models as Generalizable Policies for Embodied Tasks
Viaarxiv icon

Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots

Add code
Oct 19, 2023
Figure 1 for Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots
Figure 2 for Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots
Figure 3 for Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots
Figure 4 for Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots
Viaarxiv icon

Skill Transformer: A Monolithic Policy for Mobile Manipulation

Add code
Aug 19, 2023
Figure 1 for Skill Transformer: A Monolithic Policy for Mobile Manipulation
Figure 2 for Skill Transformer: A Monolithic Policy for Mobile Manipulation
Figure 3 for Skill Transformer: A Monolithic Policy for Mobile Manipulation
Figure 4 for Skill Transformer: A Monolithic Policy for Mobile Manipulation
Viaarxiv icon

Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second

Add code
Jun 13, 2023
Figure 1 for Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second
Figure 2 for Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second
Figure 3 for Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second
Figure 4 for Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second
Viaarxiv icon

Adaptive Coordination in Social Embodied Rearrangement

Add code
May 31, 2023
Figure 1 for Adaptive Coordination in Social Embodied Rearrangement
Figure 2 for Adaptive Coordination in Social Embodied Rearrangement
Figure 3 for Adaptive Coordination in Social Embodied Rearrangement
Figure 4 for Adaptive Coordination in Social Embodied Rearrangement
Viaarxiv icon